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Background; Single-nucleotide polynnorphisnn (SNP) analysis is a powerful strategy for large-scale molecular population studies 
examining phylogenetic relationships among bacterial strains. Mycobacterial interspersed repetitive units-variable number of tan- 
dem repeats (MIRU-VNTR) can be easily digitized to share data among laboratories. This study applied SNP and MIRU-VNTR analy- 
ses for molecular strain typing oi Mycobacterium tuberculosis isolates collected throughout Korea. 

Methods: We studied 102 clinical M tuberculosis isolates, including 6 paired strains, collected from 1 1 university hospitals in Korea 
in 2008 and 2009. SNPs were detected using hairpin primer assays, and then, MIRU-VNTR analysis was performed. 
Results: Thirty-five SNPs contained polymorphisms that helped differentiate the 96 tested isolates. The isolates were classified into 
1 5 clusters. The Beijing family strains were distributed within closely related clusters in the SNP dendrogram. For MIRU-VNTR analy- 
sis, the 96 isolates were divided into 1 2 groups. The discriminatory index in 8 of these groups (MIRU-1 0, -23, -26, and -31 ; ETR-A, -B, 
-C, and -F) was high (Hunter-Gaston diversity index > 0.6). Unlike the SNP method, MIRU-VNTR analysis did not identify any nota- 
ble localizations of Beijing or non-Beijing family isolates in specific clusters. 

Conclusions: SNP and MIRU-VNTR analyses are surrogate molecular strain-typing methods for M tuberculosis in Korea where Bei- 
jing family isolates are predominant. 

Key Words: Mycobacterium tuberculosis, Single-nucleotide polymorphism, Mycobacterial interspersed repetitive units-variable 
number of tandem repeats 



INTRODUCTION 

Molecular typing methods have been extensively applied 
to understand the genetic variations in and phylogenetic re- 
lationships among various Mycobacterium tuberculosis 
strains, particularly the strains responsible for large-scale 
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infection outbreaks and the drug-resistant strains, and to 
differentiate re-infection from relapse [1-3] . 

Detection of the IS67 JO-restriction fragment length poly- 
morphism (RFLP) is considered the gold standard for ge- 
notyping M. tuberculosis, because this RFLP has high dis- 
criminatory power [4] . However, RFLP has some drawbacks; 
a long culture time is required to obtain enough DNA, and it 
is time- and labor-intensive [5]. Unlike the IS6J JO genotyp- 
ing method, spoUgotyping, which is based on polymorphisms 
of direct repeat loci, is simple and rapid because it is PCR- 
based [6, 7]. One limitation of this method is its inability to 
discriminate Beijing family strains. This renders the method 
unsuitable for strain typing as an epidemiologic tool in Ko- 
rea where the Beijing family strains are predominant [8, 9] . 

Recently, genome sequence data of 4 M. tuberculosis com- 
plex strains have become available, and this has enabled the 
development of a molecular method using single-nucleotide 
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polymorphisms (SNPs) to identify M. tuberculosis strains 
[10, 11]. These SNPs can be used as a precise tool in phylo- 
genetic studies [12-14]. SNP analysis also provides a power- 
ful strategy for large-scale molecular population studies ex- 
amining phylogenetic relationships among bacterial strains 
[15]. However, there is no standard SNP analysis method for 
estimating genetic relationships between strains isolated 
during large-scale outbreaks of infection or drug-resistant 
strains. 

Mycobacterial interspersed repetitive units -variable num- 
ber of tandem repeats (MIRU-VNTR), another strain-typ- 
ing method, has been developed in the past decade [2] . This 
method is considerably faster, requires only small amounts 
of DNA, and can be easily digitized to share data among 
laboratories [5, 16]. Thus, SNP and MIRU-VNTR analyses 
are good alternative methods for M. tuberculosis molecular 
strain typing. 

This study appUed SNP and MIRU-VNTR analyses to M. 
tuberculosis isolates collected throughout Korea. 

MATERIALS AND METHODS 

1 . Bacterial isolates 

We studied 102 clinical M. tuberculosis isolates collected 
from 11 university hospitals in Korea in 2008 and 2009. 
Among the isolated strains, 96 were collected from single 
cultures performed in 96 patients, and 6 were collected from 
different cultures performed in the same patients. Epidemi- 
ologic distribution of the 96 isolates had already been char- 
acterized by IS6JJ0-RFLP, and all isolates were collected 
from epidemiologicaHy unrelated patients [17]. The 6 paired 
strains were used to evaluate the reproducibility of the strain- 
typing methods used in this study. 

2. SNP analysis 

We selected 45 SNPs previously demonstrated to provide 
high resolution between cUnical isolates [18]. Positions of 
each of the 45 SNPs and its nucleotides in M. tuberculosis 
H37Rv are presented in Table 1. SNPs were detected using 
hairpin primer (HP) assays as described previously [19]. 
DNA including a target nucleotide in each primer was am- 
plified using the HP assay. A wild-type HP primer and dif- 
ferent mutant HP primer were used in a complementary 
fashion. Primer sequences are pubUshed in a previous paper 
and are not repeated here [19]. In brief the ampUfication 
protocol was as follows: stage 1, 95°C for 10 min, 70°C for 
30 sec; stage 2, 72°C for 30 sec, 95°C for 20 sec, 69°C for 30 
sec, lowering 1° in the last step for every cycle during 10 cy- 
cles; and stage 3, 72°C for 30 sec, 95°C for 20 sec, and 60°C 



for 30 sec; this was repeated 40 times. PGR products were 
analyzed on a 2% agarose gel. 

3. MIRU-VNTR 

MIRU-VNTR was performed as described previously 
[20, 21]. Briefly, isolates were genotyped by PGR amplifica- 
tion of the 12 MIRU-VNTR loci (MIRU-02, -04, -10, -16, 
-20,-23, -24, -26, -27, -31, -39, and -40) and 4 exact tandem 
repeat (ETR) loci (ETR-A, -B, -G, -F). The amplification 
protocol consisted of 30 cycles of 30 sec at 95°G, 30 sec at 
61°G, and 1 min at 72°G. PGR products were analyzed on a 
2% agarose gel, and the number of tandem repeats was cal- 
culated. 

4. IS67 70-RFLP 

The IS6JJ0-RFLP patterns of the 96 isolates have been 
pubhshed previously [17]. An additional 6 stains were sub- 
jected to IS6J JO-RFLP analysis, which was performed using 
an internationally standardized method [4]. 

5. Phylogenetic analysis 

Phylogenetic analysis was performed using the complete 

Table 1. Position of 45 single-nudeotide polymorphisms and identification 
of their nucleotides in Mycobacterium tuberculosis H37Rv 



Position 


Nucleotide 


Position 


Nucleotide 


37031 


C 


2223680 


G 


43943 


A 


2376133 


A 1 


92197 


T 


2462869 


G 


220048 


C 


2532614 


<^ 1 


311611 


G 


2627946 


A 


519804 


G 


2825579 


T 1 


797736 


C 


2891265 


C 


909164 


C 


2990037 


c 1 


918314 


T 


3207247 


A 


923063 


T 


3438383 


G J 


949219 


C 


3440461 


T 


1068149 


T 


3440539 


A 1 


1163132 


T 


3450722 


T 


1191859 


T 


3455683 


^ 1 


1294396 


C 


3544707 


T 


1477596 


C 


3783054 


e 1 


1548147 


G 


4024270 


T 


1692067 


A 


4119243 


T 1 


1692683 


C 


4137829 


C 


1884695 


G 


4254003 


G 1 


1892015 


T 


4255919 


A 


1952599 


C 


4280708 


G 1 


2158580 


G 
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data of 96 isolates for SNP analysis at 45 loci and for MIRU- 
VNTR analysis at 16 loci. A dendrogram of SNP and MIRU- 
VNTR analyses was constructed from the UPGMA cluster 
based on the overall similarity of each nucleotide polymor- 
phism (http://genomes.urv.cat/UPGMA/). For discrimina- 
tion analysis of MIRU- VNTR, the Hxmter-Gaston diversity 
index (HGDI) was calculated as described elsewhere [22] 
and used for comparison of the discriminatory power of 
each VNTR locus. 

RESULTS 

1 . Evaluation and phylogenetic analysis of SNPs 

Among the 45 SNPs, 35 had polymorphisms that could 
be used to differentiate the 96 tested isolates. The other 10 
SNPs did not contain polymorphisms. Based on SNP results, 
isolates were classified into 15 clusters (Fig. 1). Cluster 1, the 
most populated cluster, contained 19 isolates, followed by 
cluster 4 with 17 isolates. Three clusters consisted of only 1 
isolate each (Table 2). As shown in Fig. 1, nearly all Beijing 
family strains, which are defined and identified by IS6110 
RFLP data [17], were distributed within closely related clus- 
ters in the SNP dendrogram. Seventy-four of the 76 Beijing 
family strains were allocated to SNP clusters 1 through 12. 
The remaining 2 isolates were allocated to clusters 13 and 
14. Nearly all 14 isolates of the K family, a subfamily of the 
Beijing family identified in a previous study [17], were allo- 
cated to clusters 4 and 6. 




Fig. 1. Distance-based neighbor joining phylogenetic treeof 96/Myra6flffe- 
rium tuberculosis isolates based on 35 single-nucleotide polymorphisms re- 
veals 15 clusters (indicated by dotted circles). Each IS6770-restriction frag- 
ment length polymorphism type is indicated by a closed circle, quadrangle, 
or arrow. 



2. Phylogenetic analysis of VNTR 

The 96 isolates showed amplification products from 16 
VNTR loci and contained at least 1 copy of each locus. Re- 
sults of allelic diversity and HGDI testing are summarized 
in Table 3. Among the 16 loci, the discriminatory index in 8 
(MIRU- 10, -23, -26 and -31; ETR-A, -B, -C, and -F) was high 
(HGDI > 0.6) according to the definition of Sola et al. [23] . 
Six loci (MIRU-4, -16, -24, -27, -39 and -40) were dispersed 
anddiscriminatedmoderately well (0.3 < HGDI < 0.6). Two 
loci (MIRU-2 and -20) were poorly discriminated (HGDI 
< 0.3). The complete data were divided into 12 groups. The 
largest group was group 2, which included 15 isolates. Group 
11 contained the only unique pattern (Fig. 2). Unlike the 
SNP method, MIRU- VNTR did not show notable localiza- 
tions of Beijing or non-Beijing family isolates in specific 
clusters. 

3. Reproducibility of SNP and VNTR analyses 

The 6 paired isolates produced exactly the same results in 
IS6J70-RFLP SNR and VNTR patterns for a given patient 
(data not shown), demonstrating that the 2 methods are re- 
producible. 

DISSCUSION 

SNP analysis was developed after whole-genome sequence 
data became available and has been for genome- scale analy- 
sis to estimate overall chromosomal relationships among 
isolates [24, 25]. SNP analysis is a powerful strategy for ex- 




Fig. 2. Distance-based neighbor joining phylogenetic tree % Mycobacte- 
rium tuberculosis isolates based on based on mycobacterial interspersed re- 
petitive units-variable number of tandem repeats of 16 loci reveals 12 clus- 
ters (dotted circles). Each IS6770-restriction fragment length polymorphism 
type is indicated by a closed circle, quadrangle, or arrow. 
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Table 2. SNP clusters of 1 02 clinical isolates oWycobacterium tuberculosis collected throughout Korea 

Single-nucleotide polymorptiism* 



clusters 


IN Ul 

isolates 


37031 


43943 


92197 


311611 


519804 


797736 


909164 


918314 


1068149 


1191859 


1294396 


1477596 


1548147 


1692067 


1692683 


1884695 


1892015 


1952599 


1 


19 


G/c 


A/g 


G 


T 


G 


T/c 


T/c 


7 


c 




C/t 


C/t 


G 


A 


C/a 


G/a 


T/c 


c 


2 


8 


G 


A 


G 


J 


G 


7 


7 


7 


c 




c 


T/C 


G 


A 


c 


G 


7 


T/c 


3 


2 


G 


A 


G 


T 


G/A 


T 


T 


7 


C 




c 


T 


G 


A 


C 


G 


7 


C 


4 


17 


G 


A 


G 


T 


G 


T 


C 


T 


C 


T/c 


c 


T/c 


G 


A 


C/a 


G 


C/t 


C 


5 


7 


G/c 


A 


G 


T/g 


G 


T 


C 


T 


C 




c 


T/c 


G 


A 


C 


G 


C 


C 


6 


4 


G 


G 


G 


T 


G 


T 


T/C 


T 


C 




c 


C 


G 


A 


C 


G 


T/c 


T/c 


7 


4 


G 


A 


G 


T 


G 


T 


T 


T 


c 




c 


C/T 


G/a 


A 


C/a 


G 


T 


C 


8 


8 


G/c 


A 


G 


T/g 


G 


T 


C/t 


T 


c 


T/c 


c 


T 


G 


A/g 


C/a 


G 


T 


C 


9 


4 


G 


A 


G 


T/g 


G 


T 


T/c 


T 


c 


T/C 


c 


T/c 


G 


A 


C 


G 


C/t 


C 


10 


1 


G 


A 


G 


T 


G 


T 


T 


T 


c 




c 


T 


G 


A 


C 


G 


T 


C 


11 


1 


G 


A 


G 


T 


G 


T 


T 


T 


c 




c 


C 


G 


A 


A 


G 


T 


C 


12 


1 


C 


A 


G 


G 


G 


T 


T 


T 


c 




c 


C 


G 


G 


C 


G 


C 


C 


13 


6 


C/g 


A/g 


G 


T 


G/a 


C 


C/t 


T/c 


c/t 




c 


C 


A/g 


A 


C 


G/a 


T 


C/t 


14 


9 


G 


A 


T/g 


T/g 


G 


C/t 


C/t 


T/c 


c/t 




c/t 


C 


G/a 


A/g 


C/a 


G 


T 


C/t 


15 


5 


G/c 


A/g 


G/t 


G/t 


G/a 


C 


C 


T/c 


c 




c 


C 


G 


A 


C 


G/a 


T 


c 



amining phylogenetic relationships among bacterial strains 
and is easy to perform [26, 27] . Among various methodolo- 
gies described previously, we chose the HP assay for this 
study to improve PGR specificity by decreasing nucleotide 
mismatching [19, 28]. Additionally, the material costs are 
lower for this method than for a sequencing-based method 
[19]. Previous studies have demonstrated that all tested M. 
tuberculosis strains were distinguishable by SNP analysis, 
and therefore, the authors proposed that SNP analysis was 
an ideal tool to investigate biological variations among clin- 
ical M. tuberculosis isolates [15, 18]. FiHiol et al. [18] identi- 
iied a minimal number of 45 highly informative SNPs by 
analyzing 212 SNPs against 323 clinical M. tuberculosis and 
Mycobacterium bovis isolates. Our SNP analysis was based 
on this set of 45 SNPs. In this study, however, only 35 SNP 
loci could be used to categorize our 96 M. tuberculosis iso- 
lates. Interestingly, nearly all Beijing family strains, includ- 
ing those of the K family, were allocated to closely related 
clusters. Our results indicate that SNP analysis using 35 loci 
is useful for phylogenetic analysis of clinical M. tuberculosis 
isolates from Korea. This method is effective, easy, and eco- 
nomical. In the 96 M. tuberculosis isolates examined, 10 of 
45 SNPs were identical. We think that this difference was due 
to the difference in strain distribution between the current 
and previous studies [18]. Our study included a larger pro- 
portion of Beijing family isolates, whereas only 15 Beijing 
family isolates, among the 246 tested isolates, were included 



(Continued to the next page) 

in the previous study [18]; moreover, those 15 isolates were 
classified as a single SNP group. Therefore, some SNPs 
proven to be polymorphic between strains isolated in 1 re- 
gion may not be suitable to discriminate strains originating 
in other regions. 

The MIRU-VNTR typing method is technically more flex- 
ible and faster than IS6J JO-RFLP typing, and the results are 
expressed as numerical codes, which are easy to compare 
and exchange [29, 30] . A previous report demonstrated that 
MIRU-VNTR analysis using weU-selected loci is useful for 
analyzing clinical M. tuberculosis isolates from Korea, where 
the Beijing family is predominant [3 1 ] . In the current MIRU- 
VNTR study using 96 isolates, we identified 2 interesting 
findings. First, in contrast to SNP results, MIRU-VNTR 
patterns were not related to IS6J JO-RFLP patterns. As shown 
in Fig. 2, isolates of Beijing and non-Beijing families were 
distributed in almost all clusters. This means that changes 
in MIRU-VNTR loci developed independently of changes 
in IS6110 insertion. Hanekom et al. [32] described discor- 
dant results between MIRU-VNTR and IS6110 RFLP and 
suggested that the degree of discordance was dependent on 
the genetic distance between isolates. However, SNP analy- 
sis resulted in clusters closely related to IS5iJ0-RFLP pat- 
terns. Filliol et al. [18] described similar close clustering of 
Beijing family strains by both the SNP and spoUgotyping 
methods. We could not locate M. tuberculosis molecular 
strain-typing data that had been obtained by simultaneously 
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Table 2. {Continued from the previous page) SNP clusters of 102 clinical isolates oWycobacterium tuberculosis collected throughout Korea 

Single-nucleotide polymorphism* 



clusters 


Not 
isolates 


2223680 


2462869 


2532614 


2825579 


2990037 


3207247 


3438383 


3440461 


3440539 


3450722 


3544707 


3783054 


4024270 


4137829 


4254003 


4255919 


4280708 


1 


19 


G 


A/g 


A/G 




c 


A/t 


G 


G 


A 


T/c 


T/c 


G/a 


c 


C/t 


G/t 


G 


A/g 


2 


8 


G 


A 


G/a 




c 


T/a 


G 


G 


A/g 


T/c 


7 


G 


C/t 


C/t 


G 


G 


A 


3 


2 


r 
u 


ft 

A 


A 






A 


r 
u 


r 
u 


ft 

A 


T 
1 


T/r 


r 
u 


T 

1 


r 
L 


r 
u 


r 
u 


ft 

A 


4 


17 


G 


A 


A 




C 


A 


G 


G 


A/g 


C/t 


T 


G 


c 


C/t 


G 


G 


A 


5 


7 


G 


A 


G 




C 


A 


G 


G 


A/g 


c 


T/c 


G 


c 


T/c 


G 


G 


A 


6 


4 


G 


A 


A 




c 


A 


G 


G 


A 


c 


T 


G 


c 


C 


G 


G/a 


A 


7 


4 


G 


A/g 


G 


T/g 


c 


A 


G 


G 


A 


c 


T 


G 


c 


T/C 


G 


G 


A 


8 


8 


G/c 


G/a 


A/g 




c 


A 


G 


G 


A 


c/t 


T/c 


G 


c 


C/t 


G 


G/a 


A 


9 


4 


G 


A/g 


A 




c 


A 


A 


G 


A/g 


c 


T 


G/a 


c 


C 


G 


G 


A 


10 


1 


G 


A 


A 




T 


T 


A 


G 


A 


c 


T 


G 


T 


C 


G 


A 


A 


11 


1 


C 


A 


G 




T 


T 


G 


G 


A 


T 


T 


G 


C 


C 


G 


G 


A 


12 


1 


G 


A 


G 




C 


A 


G 


G 


A 


T 


C 


G 


T 


C 


G 


G 


A 


13 


6 


G 


A 


G 


G/t 


C/t 


A 


G 


G 


G/a 


C 


T/c 


A 


C 


C 


T 


G/a 


G 


14 


9 


G 


A 


G 


T/g 


C 


A 


G 


G 


A/g 


C/t 


T/c 


A 


C 


c 


T 


G/a 


G 


15 


5 


G/c 


A 


G 




C 


A 


G 


T 


A 


C 


T 


A 


C 


c 


T 


G 


G 



*When 2 nucleotides are specified, a capital letter indicates the major population, a lower case indicates the minor population, and 2 capital letters indicate 2 
major populations. 

Abbreviation: SNP, single-nucleotide polymorphism. 



using IS6J JO-RFLP, SNPs, and MIRU-VNTR. Here, we de- 
scribe a close relationship between SNP and IS6110-KFLP 
and independent results for MIRU-VNTR and IS6i JO-RFLP 
This means that, unlike IS6J JO-RFLP, spoligotyping, or SNP, 
MIRU-VNTR shows high discriminatory power for typing 
M. tuberculosis strains in areas where the Beijing family is 
highly prevalent. Second, the discriminatory power of 
MIRU-VNTR in some loci varied from that described in a 
previous study conducted with 81 isolates from Korea [31]. 
In that study Yun et al. [31] reported that MIRU-23 and 
ETR-A, -B, and -C had low discriminatory power (HGDI 
< 0.3). In our study, however, these 4 loci were highly dis- 
criminatory (HGDI > 0.6), with MIRU-23 having the sec- 
ond highest discriminatory power (HGDI = 0.75). The high- 
est discriminatory power was shown by MIRU-26 in both 
studies (HGDI > 0.80). The discriminatory power of a spe- 
cific locus is inevitably affected by the selection of isolates, 
and the selection criteria were different between the 2 stud- 
ies. In the previous study, all isolates were collected from 
Pusan National University Hospital or provided by the Ko- 
rean Institute of Tuberculosis, so the distribution of patients' 
place of residences might have been restricted [31]. In this 
study, all 96 isolates were collected from 1 1 university hos- 
pitals located in various areas, and the number of isolates 
from each hospital was roughly proportional to the recorded 



Table 3. Variable repeat numbers in each VNTR locus and allelic diversity 
among 96 clinical isolates of Mycobacterium tuberculosis collected through- 
out Korea 



VNTR loci 








N of VNTR repeats 




HGDI* 


1 


2 


3 


4 5 6 7 


8 


9 


MIRU-2 


88 


9 


1 






0.25 


MIRU-4 


54 


36 


8 






0.58 ] 


MIRU-10 


65 


8 


20 






062 


MIRU-16 


31 


62 


4 


1 




0.59 ] 


MIRU-20 


90 


8 








012 


IVIIRU-23 


6 


4 


41 


24 9 14 




0Z5 


MIRU-24 


66 


17 


8 






0.58 


IVIIRU-26 


22 


3 


34 


5 11 11 9 


1 


2 086 


MIRU-27 


67 


19 


12 






0.59 


IVIIRU-31 


17 


42 


18 


14 7 




072 j 


MIRU-39 


61 


32 


5 






0.59 


IVIIRU-40 


66 


21 


8 


3 




0.60 ] 


ETR-A 


1 


27 


51 


15 4 




072 


ETR-B 


49 


14 


20 


15 




mi 


ETR-C 


1 


20 


36 


26 15 




073 


ETR-F 


18 


31 


29 


20 




062 



*Underlining indicates loci showing high discriminatory power, as defined 
bySolaetal.[23]. 

Abbreviations: VNTR, variable number of tandem repeats; HGDI, Hunter- 
Gaston diversity index; MIRU, mycobacterial interspersed repetitive units; 
ETR, exact tandem repeat. 
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patient numbers in the area served by that institution. There- 
fore, our study represents MIRU-VNTR performance more 
accurately. Combining the results of both the studies, we 
found that MIRU-10, -26, -31, and ETR-F show a high dis- 
criminatory power for molecular strain typing of M. tuber- 
culosis isolates from Korea. Our assessment of the clinical 
utility of other loci, such as ETR-A, -B, and -C, that showed 
good discriminatory power in only 1 study is reserved until 
further data are obtained. 

We confirmed the reproducibility of SNP and MIRU- 
VNTR analyses using 6 pairs of strains from different cul- 
tures of the same patients. Both SNP and MIRU-VNTR can 
discriminate clinical M. tuberculosis isolates in Korea, where 
Beijing family strains are highly prevalent. Both methods 
are PCR-based and early apphcation to clinical specimens is 
possible. Strain typing is often requested to detect outbreaks 
or contamination, and rapid results are required to control 
these phenomena. Although IS6J JO-RFLP has high discrim- 
inatory power, it is not useful for these purposes. Therefore, 
SNP and MIRU-VNTR can be used as surrogate molecular 
strain-typing methods for M. tuberculosis in Korea. 
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